-
Notifications
You must be signed in to change notification settings - Fork 234
feat: [WIP] Add support for COUNT(DISTINCT expr)
#2273
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Codecov Report❌ Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #2273 +/- ##
============================================
+ Coverage 56.12% 57.43% +1.30%
- Complexity 976 1287 +311
============================================
Files 119 146 +27
Lines 11743 13387 +1644
Branches 2251 2377 +126
============================================
+ Hits 6591 7689 +1098
- Misses 4012 4431 +419
- Partials 1140 1267 +127 ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
That looks incredibly useful! |
COUNT(DISTINCT expr)
COUNT(DISTINCT expr)
I have test failures with columnar shuffle trying to call unimplemented method |
I understand the issue now. For |
If I fall back to Spark for the shuffle, I see the same issue. So the problem is perhaps that we report the wrong schema as the output from the partial aggregate |
I learned a lot from this WIP PR and have documented my findings in #2292 so I will close this for now. |
Which issue does this PR close?
Closes #2292
Rationale for this change
Increase coverage of TPC-H benchmark.
What changes are included in this PR?
Add support for
COUNT(DISTINCT expr)
How are these changes tested?